MOD-GAUSSIAN CONVERGENCE AND THE VALUE DISTRIBUTION OF 
C(l/2 + it) AND RELATED QUANTITIES 
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Abstract. In the context of mod-Gaussian convergence, as defined previously in our work 
with J. Jacod, we obtain asymptotic formulas and lower bounds for local probabilities for a 
sequence of random vectors which are approximately Gaussian in this sense, with increasing 
covariance matrix. This is motivated by the conjecture concerning the density of the set of 
values of the Riemann zeta function on the critical line. We obtain evidence for this fact, and 
derive unconditional results for random matrices in compact classical groups, as well as for 
certain families of L-functions over finite fields. 



It is well-known (see, e.g., [301 Th. 11.9]) that, for 1/2 < a < 1, the set of values ((a + it), 
t € R, is dense in the complex plane. In fact, much more is true: it was proved by Bohr and 
Jessen that there exists a Borel probability measure ^ on C, such that the support of \x a is 
the whole complex plane, and such that the convergence in law 



holds for / : C — > C continuous and bounded. 

The corresponding density question for a = 1/2 is, however, still open (it was apparently 
first raised by Ramachandra during the 1979 Durham conference, but seems to appear in print 
only in Heath-Brown's note in |30t 11.13]): the difficulty is that the values C(l/2 + it), |t| ^ T, 
do not have a limiting distribution, as evidenced already by the Hardy-Littlewood asymptotic 



or by Selberg's result that log|£(l/2 + it)|, \t\ ^ T, is asymptotically normal with variance 
growing to infinity (see also the work of Ghosh [12] for the imaginary part, and [U §5] for a 
recent proof). In other words, "most" values of ^(1/2 + it) are rather large, though the zeta 
function is zero increasingly often as the imaginary part grows. 

In this paper, we show (Corollary [9]) how the density of values of zeta on the critical line 
would follow rather directly from a suitable version of the Keating-Snaith moment conjectures, 
which we viewed in our previous work with J. Jacod [15] as a refined version of the Gaussian 
model. In fact, under suitable assumptions, we could prove a quantitative result, bounding from 
above the smallest t ^ for which C(l/2 + it) lies in a given open disc in C. This argument is 
based on a very general probabilistic estimate proved in Section [21 which throws some light on 
the nature of the mod-Gaussian convergence that we defined in [15]. We hope that this result 
will be of further use. In another paper (jointly with F. Delbaen, see [8]), it will be seen that 
one can weaken considerably the assumption needed in order to prove the density of values of 
C(l/2 + it) (but without quantitative information). 

As applications of the general result, we will also prove the following theorems in Section [3] 
(the precise versions are given there). 
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1. Introduction 






Theorem 1. Let zq £ C x be arbitrary, e > such that e ^ \zq\. There exists Nq(zo,e), which 
can be bounded explicitly, such that 

m ({geU(N) | |det(l-< 7 )-z |<e})»(^) 2 r ^T 7 (1) 

V | Zq I / log N 

provided N ^ Nq, where fijy denotes probability Haar measure on the unitary group U(N) C 
GL(N,C), and the implied constant is absolute. 

Theorem 2. Define 

p N (t)= na-jr 172 -*)- 1 (2) 

for N ^ 1 and t G R. Let zq £ C x be arbitrary, e > suc/i i/iat e ^ There exists Nq(zq, e), 
explicitly bounded, such that 

liminf ^\({t < T | P N (t) V ' 



T->+ooT MzoK log log iV 

/or aZZ AT ATo, where A is i/ie Lebesgue measure and the implied constant is absolute. 

In a different direction, we obtain some evidence for the density of ^(1/2 + it) by looking 
at special values of families of L-functions over finite fields. In doing so, we also consider the 
analogue of Theorem Q] for symplectic and orthogonal matrices. We refer to Section U] for precise 
statements and definitions, and only state here one appealing (qualitative) corollary: 

Theorem 3. The set of central values of the L-functions attached to non-trivial primitive 
Dirichlet characters ofF p [X], where p ranges over primes, is dense in C. 

Notation. As usual, \X\ denotes the cardinality of a set. By / <C g for x £ X, or / = O(g) 
for x £ X, where X is an arbitrary set on which / is defined, we mean synonymously that there 
exists a constant C ^ such that ^ Cg(x) for all x £ X. The "implied constant" refers 

to any value of C for which this holds. It may depend on the set X, which is usually specified 
explicitly, or clearly determined by the context. Similarly, / xj for x £ X means / <C g and 
g <C /, both for x £ X. We write (x)j = x(x + 1) • • • [x + j — 1) the Pochhammer symbol. 
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2. Mod-Gaussian convergence and local probabilities 

In this section, which is purely probabilistic, we present two versions of "local" bounds 
for probabilities in the case of sufficiently uniform mod-Gaussian convergence of sequences of 
random vectors. This may be compared with the local central limit theorem (see, e.g., [21 §10.4] 
for the one-dimensional case). In fact, as F. Delbaen pointed out, one can obtain qualitative 
statements that generalize both the standard local central limit theorem and recover our results 
below under weaker assumptions. However, our emphasis is on explicit quantitative lower 
bounds for local probabilities. 



We first introduce the definition, generalizing [15] to random vectors. Fix some integer 
m ^ 1, and let (Xjy) be a sequence of R m -valued random variables denned on a probability 
space (f2,£,P) (as is the case for convergence in law, we could work without change with 
random variables defined on different probability spaces). Let 

QN(t) = QN(ti,h, ■ ■ ■ ,t m ) 

be a sequence of non- negative quadratic forms on R m . The sequence (A^v) is then said to be 
convergent in the mod-Gaussian sense with covariance Qn and limiting function <3? if 

lim exp(Q JV (t)/2)E(e i *-^) = *(t) (3) 

N— >+oo 

locally uniformly for t £ R m ; $ is then a function continuous at and ^(0) = 1. Here, • denotes 
the standard inner product on R' m . 

The intuitive meaning is that, in some sense, Xn is "close" to a (centered) Gaussian vector 
Gn with covariance matrix Qn- As in [T5], this notion is of most interest if the covariance 
"goes to infinity". However, in contrast with the case of m = 1, this can mean different things 
because there is more than a single variance parameter involved. 

To discuss this, we diagonalize Qn in an orthonormal basia3 in the form 

Qiv(i) = 5i )N ul H h 5 m ^ N u 2 m , ^ <5i jA r ^ 5 2 ,n ^ ^ 5 m ,N 

where u = HN(t) is the necessary (orthogonal) change of variable. Then u Qn goes to infinity," 
in the weakest sense, means that the largest eigenvalue 5 m ^ n goes to +oo as N — > +oo. 

We are interested in the distribution of values of Xn as N grows; clearly, if (say) the Qn 
are already diagonalized in the canonical basis and S\ t N is constant, these values will have first 
coordinate much less spread out than the last ones. To simplify our discussion, and because 
this is the situation in our applications, we will assume this behavior does not occur and that 
in fact the smallest eigenvalue goes to infinity. For simplicity, we will assume in fact that for 
some fixed n > 0, we have 

Sm,N ^ $i iN , Sm t N ~> +OO, (4) 

so that also Sin ~^ +°° ( we sav that the convergence is balanced). In our main applications, 
this will be the case with fi = 1. Note in particular that it follows that the discriminant 

= Si,N ■ ■ ■ S m ,N ^ S™ N 

goes to infinity as N — > +00, and moreover 

cjv «S S™n- (5) 

Our question is now the following: given (Xn), (Qn), as above, with this type of mod- 
Gaussian convergence, can we bound from below the probability 

P(X N G U), 

where U is a fixed open set in R m ? 

Denoting by Qn(x) the dual quadratic form, the Gaussian model suggests that, if U is 
relatively compact (e.g., some non-empty open ball), we could expect 

P,X„ e V) „ P( K e V) . ^= I e~^ ix . ^L=, (6) 

as iV +00, since S\ t N +°o implies that Qn(x) — > for all x £ R m . We will confirm that 
this holds in certain conditions at least. We strive especially for lower bounds on ~P(Xn £ U), 
which we wish to be quantitative, so that we can determine some Nq (depending explicitly on 
U) for which 

P(X No eU)>0. 

Note that such a quantitative result must depend on the location of the open set U, whereas 
the limit itself only depends on the volume, as seen above. 



With respect to the standard inner product on R m . 
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The specific hypothesis we use may seem somewhat arbitrary, but they turn out to be satisfied 
(with room to spare) in the later applications. 

Theorem 4. Let m ^ 1 be fixed and let (-Xjv) be a sequence of H m -valued random variables 
defined on (O, S,P) ; such that (Xjv) converges in the mod-Gaussian sense with covariance 
(Qn), and that the convergence is ^-balanced with fi > 0, with o~n ^ 1 for all N . Let (G/v) be 
Gaussian random variables with covariance matrices given by (Qn), so that 

exp(-Q N (t)/2)=E(e u - GN ). 

Assume moreover the following three conditions: 

(1) There exist constants a > 0, a > and C > such that, for any N ^ 1 and t G R m such 
that \\t\\ ^ crff, we have 

E(e ux -) = Ht)eM-QN(t)/2){l + 0( / )}. (7) 

(2) The function $ is of class C 1 on {\\t\\ < 2}. 

(3) For some A ^ 1 and (3^0, we have 

|$(t)|<exp(/3||t|| A ), (8) 

forte R m . 

Let D > be any number such that 

D > 2(m + 1 + max{a-\ A/C, 3m(m + 1)/M}). (9) 

Then, for any fixed non-empty open box 

U = {x G R m | \\x- xolloo < e} C R m , 

with width e such that < e ^ 1, we have 

p(x N gu) = p(g n e co + o( 1/2+1/J + ■— J , (10) 

0~N N 

for N ^ 1, where the implied constant depends only on (m, a, a, C) and i/te implied constant 
in ©. 

In particular, for any fixed non-empty open set U C R m , we Ziaue 

P(X W G U) > ' 



'0~N 

provided N ^ iVo, where Nq and the implied constant depend on U and the same data as above. 
Note the following elementary lower bound, valid if e ^ 1: 

-.m 



P(G iV GC/)»4 = exp(-^l) (11) 

where the implied constant depends only on m; this is where the location of U enters, since the 
error term will only be smaller than this, roughly, when Qn(xo) ~ 1- 

Remark 1. The growth condition ([8]) is in fact a consequence of the uniform mod-Gaussian 
convergence, at least provided the sequence (<r/v) does not grow too fast. For instance, if 

<T N+ i ^ Ma% (12) 

for all N ^ 1, for some constants M ^ 1, B ^ 0, we can obtain ([8]) with A = 2 + £?/a. Indeed, 
we can write 

|$(t)|= § N (t)e Q »®' 2 (l + o( ?—rr)) <2e^W/ 2 

V \exp(acr]y)/ / 

if iV is large enough and ||i|| ^ aj^. Note then that 

Qjv(i) ^ 5 mj Ar||t|| 2 < (5^Ar||t|| 2 < CT^ ||t|| 2 . 
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We now fix N ^ 1 minimal such that 

CJV-l < ||*|| ^ a N, 

and if this value of TV" is large enough, we get o~n ^ C°"jv_i ^ M||t|| B//a , and hence 

|$(t)| ^ 2exp(a7v||t|| 2 ) < 2 exp(M||t|| 2+B / a ), 

as desired. On the other hand, if this chosen N is too small, ||t|| is bounded, and the desired 
estimate is trivial. 

Proof of Theorem^ Let 5n = 5i,jv be the smallest eigenvalue of Qn, so that Qjv(*) ^ ^at||*|| 2 
for all t G R m and N ^ 1. For simplicity, we denote also 

7at = exp(aa^), (13) 

as in 0. 

We now first fix w such that < w < 1, and then fix a smooth, compactly supported function 
go on R such that 

< go < 1, 

ffo(^) = for \x\ ^ 1, go(x) = 1 for |z| ^ w, 

\g ( j \x)\ <ZjA>, for j >0, a; G R, 

where A = (1 — it;) -1 and the implied constant depends only on j (we will define w to be a 
function of N at the end, and hence must be careful to have estimates uniform in terms of w; 
this is provided by using only the above properties of go; the maximal value of j used will also 
be bounded only in terms of m, $ and the data in ©). It is classical that such a function exists 
(examples are constructed in |13l §1-4]). Then define 

fo{x) = fo{xi,...,x m ) = Y[ 9o(xj) 

for x G R m . It follows that 

< fo < 1, 

fo(x) = for H^Hoo ^ 1, fo(x) = 1 for \\x\\oo < w. 

Next, we define 

/(x)-A(^), 

and we start our argument with the obvious inequality 

P(X N eU)^ E(f(X N )) = [ f(x)du N (x) 

JB. m 

where vn is the law of X^. Applying the Plancherel formula, we get 

P(X N G U) > ! f(x)dv N (x) = j f(t)$ N (t)dt 

JR. m JR™ 

where $jv(*) = E(e* w ) is the characteristic function of Xjy and 

/(*) = f f(x)e-*"dx 



(2tt)-/2 y R „ 

denotes the Fourier transform of / (the smoothness of / guarantees that / is in L 1 , so the 
Plancherel formula is valid by a simple Fubini argument). 
We have 

f{t) = e m e- it ^fo[et)= e m e- it ^ J] go(etj), t G R m . 

Since 
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for t 7^ and j ^ (by repeated integration by parts), we find that 

|Stt(t)| «min(l,A^P), 

the implied constant depending on j. 

Using the formula for f(t), selecting for given t an index j so that ||t||oo = an d applying 
the second upper bound above for this index only, if \tj\ ^ 1, we derive 

\f(t)\ <min(e m ,A B+m e- B ||t||- B - m ), t € R m , (14) 

for any fixed B ^ 1, where the implied constant depends only on m and B. In particular, for 
||*|| oo ^ 1) we will use simply the upper bound ^ £ m ||/o||oo ^ £ m - 

We now proceed to approximate. First of all, for any radius Rn ^ 1, we can estimate the 
contribution of those t with \\t\\ > Rn using the estimate above with a value of B ^ 1 which 
will be determined later. After integrating over ||t||oo > Rn, we obtain 



/ f(t)$ N (t)dt « e- B A m+B R N B 

J\\t\\>R N 

for any Rn j3 1- After selecting Rn = cr^ ^ 1, we obtain 

/ f{m N {t)dt^e- B A m+B a N \ (15) 
J\\t\\>R N 

for N ^ 1, the implied constant depending only on /o. 

On the other hand, provided B > 1/a, we use (|7|) and flSJ) to get 



L 

J \\t CR 



dt 



f(t)<Z> N (t)dt= [ f(t)$(t)exp(-Q N (t)/2)h + o(— 

exp(-Qjv(t)/2)dt + 0( 7 ^ / Mt)f(t)\dt 

t\\^R N v 

/(t)$(t) exp(-Qjv(t)/2)dt + 0{e m lN l exptfRfr)) 

for iV ^ 1, using again the definition of /, the implied constant depending on <3?. 
By (fl~3|) , the last term can be bounded by 

e m 7^ exp(/3i4) < e m exp(/34 /B " < 

for iV ^ 1 if B > A/C, the implied constant depending on (a, A, B, C). 

We then split the first term further in two parts, namely where Qjv(*) ^ k at> an d where 
Qtv(*) > k%- The parameter kn will be chosen later, in such a way that the region {||t|| ^ 1} 
(which is inside {||i|| ^ Rn}) contains the region Qjy(t) ^ k n (which is a neighborhood of 
that contracts to as N — > +00, if kn does not grow too fast, since it is an ellipsoid with 
longest axis Kjv/v/^nviv)- 

The second part of the integral is bounded by 

f(t)<S>{t)e- QN ^' 2 dt < e m R%exp((3R N - k%/2) 

t\\^R N , QN{t)>K% 



m ( m 1 , q A/B K N 

e exp log a N + pa N ' - -jj- 



and in the first part, we use the approximation 



<&(*) = l + 0(||t| 
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for {||t[| ^ 1}, coming from the C 1 assumption on <£, to get 

f{t)^{t)e- QN{t)l2 dt = [ f(t)e- QNit)/2 dt + o( f \f(t)\\\t\\dt) 



f {t)e -Q N (t)/2 dt + ^_^\ (16) 
2 V yJd N a N ) 



the implied constant depending on <3?, where the last integral was estimated using 



- ^ r\ u\ ^ i • i - Kn 



1*11 < t-QaK*) < N < 



We can rewind the computation for the first term, with the Gaussian Gn instead of for 
N ^ 1, we have 



/ f{t)e- QN ^/ 2 dt = E(/(GW)) + 0(£ m e- K ^/ 2 



where the implied constant is absolute, and then we write 

E(/(GW)) = P(||GW - xolloo < e) + 0(P(we < ||GW - *o||oo < e)) 

= P(\\G N - xolU < e) + 0(e m (l - w)a N 1/2 ) 

for N ^ 1, where the implied constant depends only on m (the last step is obtained using the 
density of the Gaussian Gn)- 
Summarizing, we have found 



P(X N eU)^ P(G N £U) + 



e m (l - w) A m+B 



rn, . „ a/b K Af^ i £ k n 



(where we recall that A 1 = 1 — w). 
Now if A/B < 1 /(m/i), and 

we have first (see (0)) the condition 



+ e -exp( -logajv + ^'-^f + T- 
-D z / V&non 



A/B 



{Qjv(f)< 4} c {||t|| 2 <5r> c {\\t\\^ l}, 

Oat 

and moreover the third error term is absorbed in the second one, while the last becomes 

_1_ 1 (m+l)A 
N < £ m a 2 2 m(1 + B 

Thus if we select any B > max(a _1 , A/C, 3m(m + l)Apk), the previous conditions on B hold, 
and we find that 

/£ m (l- W ) A m+B e m \ 

P(X N G EO > P(G N 6 IT) + 0(^7^ + ^— + 1/2+1/(6m J • (17) 

Now, we attempt to select w to equalize the error terms involving it, i.e., so that 

e m {l - w) _ A m+B 

which translates to 

A = (1 - W)- 1 = ^^fKrn+B+l) 

and two cases arise: 



(i) If a l Jj 2 > e 2m , we have A > 1 (as is necessary to define w in this way), and we obtain 
from the above that 

P(X N eU)>P(G N eU) + 



1/2+1/0, 
N 

where D = 2(m + l + B) (note that, since A is assumed to be ^ 1, we have 2(m + B + l) > 6m/j.) 
r 1/S 



(ii) If we have a 1 ^ 2 ^ e 2m , we take simply w = 1/2 and obtain 



e m (l - to) A m+B 

+ — < 



where the implied constant depends on m and B. 

The combination of the two cases leads to the lower-bound in (|10p. The upper bound is 
proved similarly, using instead of go a function which is = 1 for \x\ ^ 1 and = for \x\ ^ 1 + w 
for some suitable w > 0; we leave the details to the reader. □ 

Remark 2. If we are interested in obtaining a lower bound only (which is the most interesting 
aspect in a number of applications), it is simpler and more efficient to fix, e.g., w = 1/2, from 
the beginning of the proof. For 

U = {x G R m | \\x - xqWoo < e}, 

this leads for instance to 

P(X N e U) > P(G N e U.) + 0(- TJ ^-^) (18) 

for any 7 > (by taking B large enough depending on 7), where 

U- = {x G R m I \\x - xolloo < e/2} 
and the implied constant depends on <E>, (m, o, q, C) and 7. 

Remark 3. From the probabilistic point of view, this proposition gives one answer, quantita- 
tively, to the following type of question: given a sequence (Xn) of (real- valued) random variables 
and parameters — > +00 such that Xn/&n converges in law to a standard centered Gaussian 
variable (with variance 1), to what extent is Xn itself distributed like a Gaussian with variance 

Here, we perform the comparison by looking at P(Xtv G U), U a fixed open set. And this 
shows clear limits to the Gaussian model: for example, any integer-valued random variable X^ 
will have P(Xjy G U) = for any open set not intersecting Z, and yet may satisfy a Central 
Limit Theorem (e.g., the iV-th step of a symmetric random walk on Z). Even more precisely, 
denoting 

d K (X, Y) = sup I P(X ^x)- P(Y ^ x)\ 

the Kolmogorov distance, there exist integer-valued random variables Xn with 

d K (X N ,G N ) ^N- 1 / 2 , (19) 

where Gat is a centered Gaussian with variance N, which indicates a close proximity - and 
yet, again, P(Xn G U) = if U n Z = 0. For instance, take Xn with distribution function 
F N {t) = P(X N ^ t) given by 

F N (t) = P(G N < fc) for fe < t < fc + 1, k G Z. 

In particular, it is also impossible to prove results like Theorem H] using only this type of 
assumption on the Kolmogorov distance. Of course, if we assume that 

d K {X N , G N ) < N-WftN)- 1 , (20) 

with 4>(N) — > +00 (arbitrarily slowly), we get 

P(X N eU)^ P(G N G U) - 2d K (X N ,G N ) » N- 1 ' 2 , 



for U = [a, f3], a < /3. But such an assumption is unrealistic in practice. For instance, if one 
assumes that (Xn) converges in the mod-Gaussian sense with covariance Q7v(i) = &Nt 2 , and if 
the limiting function <E> is C 1 and the convergence holds in C 1 topology, one can straightforwardly 
estimate the Kolmogorov distanc^ by 

d K (X N , G N ) = sup | P(X N < x) - P(G N ^ x)\ < a N 1/2 , 

which is comparable to (|19p . but one can also check that this can not be improved in general 
to something like (|20p . 

In Example[l]in Section[3l we will also describe a much deeper and more illuminating situation 
concerning the limits of what can be hoped, even with something like mod-Gaussian convergence. 

Remark 4. Other variants could easily be obtained. In particular, it is clear from the proof that 
if <3? has sub-gaussian growth, i.e., we can take A = 2 in ([5]), the results can be substantially 
improved. However, in our main applications, this condition fails. Also, one could use test 
functions / which decay at infinity faster than polynomials to weaken the uniformity requirement 
in the convergence condition (|7|) (for instance, for m = 1, it is possible to find / which is smooth, 
non-negative and compactly supported in any fixed open interval and satisfies 

/(t)«exp(-|t| 1 - £ ) 

for any e > 0, as constructed, e.g., in [131 Th. 1.3.5] or [2]). Again, for our main unconditional 
applications, our conditions hold with room to spare, so we avoided this additional complexity. 

Remark 5. We can also introduce a linear term (corresponding roughly to the expectation of 
Xjy) in addition to the covariance terms in the definition of mod-Gaussian convergence (as 
in [15] for m = 1), but this amounts to saying that (AT/v) converges in the mod-Gaussian sense 
with covariance (Qn) and mean (£jv), £,N £ R"\ if the sequence (Xjy — £jv) converges in our 
original sense above. But note that the interpretation of a lower bound for P(X^ — £n £ U), 
as given by Theorem 0] for a fixed U C R m , is quite different, if £tv is itself "large". Maybe 
one should see the statements in that case as giving natural examples of sets Un for which one 

— 1/2 

knows that P(Ajv G £/jv) has the specific decay behavior N -> +oo. 

In particular, lower bounds for P(Ajv — £n £ U) do not give control of P(X^ G U), and 
indeed this may be zero for all N large enough (see, for instance the example in Section 14.21 
below of values at 1 of characteristic polynomials of unitary symplectic matrices, which is always 

3. Random unitary matrices and the zeta function 

We present now some applications of Theorem [5] (in particular, proving Theorems [1] and [2|) . 
We also give an example that illustrates the limitations of such results, suggesting strongly that 
one can not replace mod-Gaussian convergence with the existence of the limits Q only for t in 
a neighborhood of the origin. 

Example 1. One of the canonical motivating examples of mod-Gaussian convergence is due to 
Keating and Snaith [22J. Let 

A 7V = logdet(l-T 7V ), 

where T/v is a Haar-distributed random unitary matrix in the compact group U(N); we view 
these random variables as R 2 -valued (via the real and imaginary parts), so if t = (ii , ^2)? we 
have 

t-X N = t 1 Re(X N )+t 2 lm(X N ). (21) 

We first clarify the choice of the branch of logarithm: Xn is defined almost everywhere (when 
1 is not an eigenvalue of Ijy), and for g £ U (N) with det(l — g) 7^ 0, such that 

det(l-T<?)= J] (1-ajT), |oy| = 1, 



Using the Berry-Esseen inequality, see e.g. [291 §7.6]. 
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we define 



log det(l — g) = lim logdet(l — rg) = > lim log(l 

r->l ' „ ^—^„, r— >! 



ra 



3)1 



r<! Kj<N r <l 

where the last logarithms are given by the Taylor expansion around 0. This is the same con- 
vention as in [22J P ar - after (7)]. 

Keating and Snaith show that (-Xjv) satisfies 

E(e it-x N)= tt rc/TO+jti) 

1 i J i N nj + Wti+t2))r(j + Uiti-t 2 )) { ' 

for t = (tx,t2) G R 2 (note the asymmetry between zti and £2), see [22j eq. (71)], taking into 
account a slightly different normalization: their t is our it\ and their s is our £2- It is also 
useful to observe that, in the connection with Toplitz determinants (see [3]), this characteristic 
function is the iV-th Toplitz determinant corresponding to the symbol which is a pure Fisher- 
Hartwig singularity of type 

b( e ie ) = (2 - 2 cos #)*' l/2 exp(i(# - n)t 2 /2), < 6 < 2vr 

(this is denoted 

tt 2 /2(e id )u itl/2 (e ie ) = iiit^t 2 )/2{e ie )^t 1+ t 2 )/2{e ie ) 
in [TU], where the formula (|22|) is stated as Eq. (41); see [B] for two elementary computations 
of the corresponding Toplitz determinants.) 

We rewrite this in terms of the Barnes function G(z), as is customary. We recall that G is 
an entire function of order 2, such that G(l) = 1 and G(z + 1) = T{z)G{z) for all z, and that 
its zeros are located at the negative integers. In particular, it satisfies 

for all N ^ 1 and 6 £ C. 
Thus, we have 



Wp u-x N) _ tt mrV + iti) G(1 + ^)G(1 + ^ 

> 11 tV„- 1 iti+t 2 \r(A 1 it±-t 2 \ a(\+it x ) 

G(l + it! + N)G(1 + N) 



G(l + + N)G(l + + N) 
(see, e.g., [333 eq. (41)]). We now get from [333 Cor. 3.2] that 



(7(1 + «izia)G!(l j. «i±*a- 



2 /"v- 1 1 2 



G(l + tti 

gh 4- ^ 

exp(-Q N (t)/2 



G(l + ^f^)G(l + 



G(l + iti) 
where 

QN(h,t 2 ) = 6 N (t 2 1 +t 2 2 ), 5 N = ±logN, 
hence we have complex mod-Gaussian convergence with limiting function 

*W'* 2 )- G(l + it!) • (24) 

Here we can take fi = 1 in @. 

Remark 6. Note that this is noi the product of the two individual limiting functions for mod- 
Gaussian convergence of the real and imaginary parts of Xn separately (which are 3> s (ti, 0) and 
^9(0,^2)); although after normalizing, one obtains convergence in law of 

(Re(X N )/y/5^, lm(X N )/y^) 
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to independent standard Gaussian variables, as noted by Keating and Snaith. 

We now check that Theorem 0] can be applied to the sequence of random variables (Xn). The 
fact that & g is of class C 1 on R 2 is clear in view of the analytic properties of the Barnes function. 
Condition (3) is also obvious. A uniformity estimate like is not found in [22] or [10], though 
it is proved for t in a fixed compact region of R 2 in [10, Cor. 3.2]. In Proposition E2 in the 
Appendix, we prove 

$ jV (t) = $(t) e -^w/ 2 fi + of 1 + l|/| " 



N 

for ||t|| A 1 / 6 . In view of on = (logA) 2 /4, this is compatible with ([7]) and (fT3|) . with a 
arbitrarily large, C = ^ and A (defined by ([5])) can be taken to be any A > 2. Thus the 
constant D in (|10p can be any number 

D > 2(3 + max(4,36)) = 78. 

Moreover, since 



r 2 , Jl 

Qn(xi,x 2 



i log N' 

we see by using Remark [2] (namely, (|18p ) and (jlip that we have the following corollary: 
Corollary 5. For < e < 1 we have 

P{\X N -z \ <e) »e 2 (logA)- 1 (25) 

for all N with 

N > max{exp(|zo| 2 ),exp(C;T~ 9 )} 
where both implied constants are absolute. 

(The first condition on N ensures the main term is S> £ 2 (log A) , while the second ensures 
that the error term is smaller; we have taken 2/9 = l/(2m/x) — 7 = 1/4 — 1/36 for definiteness; 
any number > 8 can replace 9). 

Now we appeal to the following elementary lemma: 

Lemma 6. Let zq G C x and e > 0, and denote 

wo = log 1 2r 1 +i6 , 0o = Arg(z ) 6] - 7r,7r]. 
Then, provided e ^ \zo\, we have 

\e w - z \ < e 

for all w £ C such that 

e 

\w - w \ < - — 
2|^o| 

For given zq £ C, non-zero, we get from this and ([25]) . applied to log \zo\ + i Arg(zo) and to 
e/jzol instead of e, the following explicit form of Theorem [TJ 



P(|det(l-7jv)-z | <£)> 



Theorem 7. Let zq G C x 6e arbitrary, e > suc/i t/iai e ^ |zo|- have 

e \2 l 
N/ log A 
/or A ^ Ao(zo)^); where 

N (z ,e) < max|exp((log|2;o|) 2 ),exp^C^^-|) ^)| 

where C and the implied constants are absolute. 

Remark 7. It follows from asymptotic formulas for the Barnes function, e.g. [11], that we have 

p^log|<D 9 (t)|xlog(2||t||), 

which is illustrated in Figure 1. This super-gaussian behavior is the main cause of difficulty in 
the proof of Theorem HI 
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Figure 1. Graph of 4 log \<f> g (l,t 2 )\, l^t 2 ^ 140 



Remark 8. Note that if one only wants to say that det(l — Tjv), for N growing, has dense image 
in C, much simpler topological arguments suffice. 

Example 2. Another conspicuous example of mod-Gaussian convergence is the arithmetic 
Euler factor in the moment conjecture for ^(1/2 + it). In [15} §4.1], we considered this factor 
for the real part log + it)\ only, and we first generalize this as in the previous section. 

Consider a sequence (X p ) of independent random variables uniformly distributed on the unit 
circle and indexed by prime numbers, and let 

L7V = -^log(l-p- 1/2 X p ) 

where the logarithm is given here by the Taylor expansion around 0. For each individual term 
E p = — log(l — p -1 / 2 X p ), we have 



E(e^) 



2tt 



2- 



(1 - ae ie )-2^ +it ^(\ - ae- i9 )2^- u ^d9 



with a = p 1 ' 2 . Expanding by the binomial theorem and picking up the constant term in the 
expansion in Fourier series, we obtain 



E(e^) = Y, a 



2/ 



\{t 2 + it x )\ /|(t 2 -ih) 



\- {\{t2 + iti)) 3 {\(iti -t 2 ))j 2j 

2. (in2 



j>0 



2 Fl(i(z'ti +t 2 )Mit 1 -t 2 )-l-a 2 ) 



in terms of the Gauss hyper geometric function. 
Arguing as in |15j. we see now that 



E(e 



it-Lr, 



p^N 



4 p 
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and hence, denoting 

Sn = ~2 ^ ^Si^-P' 1 ) ~ 7^ log log A^, Q N (t) = 5 N {t\ + t 2 ) = 5 N \\t\\ 2 , 

p^N 

we get 

E(e u ' LN ) ~ exp(-Q N (t)/2)$ a (t), 
as N — > +00, with limiting function 

$«(t) = II ( x " -)~" < " /4 2^i(|(^i + *2), - (26) 

We have here also fi = 1 in Now, to check the uniformity required in ([7]), we write 

E(e^) = exp(-Q N (t)/2)* a (t)R N (t) 

with 

«J\r(t)= II l \Fi{\(it l +t 2 ),\(it l -t 2 )-l; p - 1 ). 
p>N P 

If we expand the p-factor using the binomial theorem, we obtain 

l + Y,— Yl ^ itl+t2 ^ a ^ ih ~ t2 ^ a f~^ 2/i 



p> ^ (a!) 2 V b 



and if we assume that ||i|| ^ A with A ^ 1, crude bounds show that this p- factor is 



where the implied constant is absolute, so that if ||i|| ^ A^ 1 ' 8 , for instance, we get 

RN(t)=U (i + o(5>" 3i/4 )) =i + o(n-^). 

p>N j>2 

Although this is crude, it already gives much more than ([7]), both in terms of range of 
uniformity and sharpness of approximation. 

Since Condition (3) is also obviously valid here, Theorem H] (or rather (\18\i ) applies with A 
any real number > 2, a and C arbitrarily large, and shows that 

V{\L N -z Q \ <e)^e 2 (\og\ogN)- 1 

for any zq £ C and e < 1, provided 

A" 3> max exp(exp(|,zo| 2 )), exp(exp(Ce -9 )) 



for some large constant C ^ 1. 

From this we deduce easily the more arithmetic- looking statement of Theorem [2j Indeed, let 
Pjv(i) be given by ([2]). For fixed N, it is well-known that the random variables t 1— > P/v(t) on 
the probability spaces ([0, T],T _1 A) converge in law, as T — > +00, to 

Pn = J] (! -P~ 1,2 Xp)- 1 = exp(Ljv), 

p^N 

where X p are as above (independent and uniformly distributed on the unit circle; the inde- 
pendence is due to the fundamental theorem of arithmetic). For any open set Vq it follows 
that 

liminf ^\({t < T I P N (t) € V}) > P(P N 6 V). 



^ Because we do not know if the probability that Pn is in the boundary of V is zero or not, we do not claim 
or need to claim - an equality; see, e.g., [TJ Th. 2.1, (iv)]. 
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Applying Lemma [6] as in the proof of Theorem we obtain Theorem [2] with 

e \-8 



N (z ,e) <€. max|exp^exp^(log |z |) 2 ^ , exp^exp^C 
for some absolute constant C. 



2*0 



]_ j\u-i og «i/2 +l u) du = $ a ( f ) #s ( t )exp(-|(loglogT))(l + o(l)) 



Remark 9. Again, the density of values of Pjv(t) for iV ^ 1 and t£R (or of P/v, which amounts 
to the same thing) is an easier matter that can be dealt with using topological tools. 

Example 3. The two previous examples are of course motivated by their conjectural relation 
with the behavior of the Riemann zeta function on the critical line (this is the arithmetic essence 
of [22] ). Indeed, Keating and Snaith conjecture that: 

Conjecture 8. Define log ("(1/2 + iu), when u G R is not the ordinate of a non-trivial zero of 
C(s), by continuation along the horizontal line Im(s) = u, with limit when Re(s) — > +oo. 
For any t = (£1,^2) £ R 2 ; we have 

rT , +1 

as T — > +00, where • is the inner product on R 2 as in (|2ip . 

Hence, we see in particular that the following holds: 

Corollary 9. Assume there exist a > 0, 5 > and 9 > such that Conjecture® holds with the 
error term o(l) replaced by 

exp(-a(log logT) 5 ) 

uniformly for \\t\\ (loglog6T) e . Then the set of values ^(1/2 + it) is dense in the complex 
plane. In fact, there exists C > 0, D ^ 0, such that, for any zo G C x and e there exists 

t with 

^ t < max|exp(exp((log|z |) 2 )),exp(exp(^C^^-|-^ ^)|, 

swc/i that 

|C(| + it) -JZb| < e. 



Of course, such a strong conjecture concerning the imaginary moments of ^(1/2 + it) looks 
quite hopeless at the current time: there is no known non-trivial result available, even assuming 
the Riemann Hypothesis. But Example H] below suggests that (with this approach) it is indeed 
necessary to require that the characteristic function converge uniformly for t in a region growing 
with T. In [8], jointly with F. Delbaen, we will explain how the weaker qualitative statement 

\A({u€ [0,T] I C(|+ra)£7})» ' 



T y J ' loglogT 

for a fixed open set V and for T large enough (which of course suffices to give a positive answer 
to Ramachandra's question) can be proved under much weaker assumptions than a uniform 
version of Conjecture El 

Remark. Another remark concerning Conjecture [8] has to do with the factored form of the 
limiting function <& a (t)<& g (t), which seems to imply some asymptotic independence property. 
Recall that the real and imaginary parts of <& a and <3? s are themselves asymptotically independent 
after renormalization, but are not products of the limiting functions for the two parts separately. 
So Conjecture [HI if correct, is evidence of quite particular probabilistic behavior^ 



4 The fourth moment of ("(1/2 + it) and a few other results do provide evidence of a factored limiting function, 
with "random matrix" term split from the Euler factor. The mod-Poisson analogy is also consistent with this, 
in the case of the number of prime factors of an integer, as discussed in detail in [271 §4, 5, 6]. 
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Example 4. Our assumptions in Theorem 2] are probably not optimal. We now describe an 
illuminating (counter)-example in the direction of understanding when a result like this could 
be true. 

We again look at random matrices Tjy in the compact group U(N) (as in Example [TJ, but 
this time we consider the random variables counting the number of eigenvalues in certain fixed 
arcs of the unit circle: fix 7 e]0, l/2[, and let 

I = {e 2i7rd I \6\ < 7} C C. 

Then let Xjy be the number of eigenvalues 1? of T\r such that $ E I. Note that Xjy is an 
integer-valued random variable. It was proved by Costin and Lebowitz that 

X N - 2 7 iV 
vr- 1 v / loglV 

converges in law to a standard normal random variable. Wieand [31] gave a proofl based on 
asymptotics of Toplitz determinants with discontinuous symbols; as noted by Basor, this gives 
the asymptotic 

E / e it(jr w -27JVh „ e X pf-^ i-logiV)(2 - 2cos47t 7 )4^g( 1--)g(i + -) 

as N — > +00, for all t with \t\ < ir (see, e.g., [101 Th. 5.47], applied with N = 2, a\ = 02 = 0, 
Pi = 4=, P2 = — 2IF' ano - *he con dition on t is equation (5.79) in loc. cit., or [3l p. 331]). 
This asymptotic is of course of the form ([3]) for these values of t, but the restriction \t\ < it is 
necessary, since the characteristic function of Xn is 27r-periodic for all N. The convergence is 
sufficiently uniform for t close to to allow the deduction of the renormalized normal behavior 
(as Wieand did, using the Laplace transform instead of the characteristic function), but when 
7 is rational, the set of possible values of Xjq — 2"fN for N ^ 1 is a discrete set in 

4. Distribution of central values of L-functions over finite fields 

We now consider examples related to L-functions over finite fields. Our main input will be 
deep results of Deligne and Katz, and we are of course motivated by the philosophy of Katz 
and Sarnak [21] . 

The goal is to make statements about the distribution of values at the central point 1/2 
of L-functions over finite fields. The appealing aspect is that these form discrete sets, hence 
proving that they are dense in C (as in Theorem [3|), for instance, is obviously interesting and 
meaningful. We consider examples of our results for the three basic symmetry types in turn: 
unitary, symplectic, and orthogonal. For the last two, this means first obtaining a suitable 
analogue of Example [TJ The corresponding limiting functions have already been studied in some 
respect by Keating-Snaith [23] and Conrey- Farmer [7], though our expressions seem somewhat 
more natural. 

4.1. Unitary symmetry. Let F q be a finite field with q elements. Unitary symmetry arises 
(among other cases) for certain types of one-variable exponential sums over finite fields, which 
are associated to Dirichlet characters of F 9 [X], which we now describe; these will lead to a proof 
of Theorem [3l 

Let F q be a finite field with q elements of characteristic p 7^ 0. A Dirichlet character modulo 
g G F 9 [X] is a map 

rj : F q [X] -> C, 



Including a more general result concerning the joint distribution of the number of eigenvalues in more than 
one interval. 

^ For what it's worth, one may mention that the density of values Xn — 2*yN is true for irrational 7, by 
Dirichlet 's approximation theorem, and by the existence of matrices in U(N) where the number of eigenvalues in 
I takes any value between and N. 
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defined by 

. . I if / and g are not coprime 

\v(f) otherwise, 
where r/ is a group homomorphism 

ry : (F q [X\/gF q [X\)* — ► C x . 

This character is non-triviai if 77 7^ 1, and primitive if it can not be defined (in the obvious 
way) modulo a proper divisor of g. The associated L-function is defined by the Euler product 

for s G C, where the product ranges over monic irreducible polynomials in F 9 [X] and \n\ = 
gdeg(7r)_ Q ne g^Q^ q U ite easily that this is in fact a polynomial (which we denote Z(rj,T)) in 
the variable T = q~ s of degree deg(g) — 1, if 77 is primitive modulo g and non-trivial. 

The examples used in proving Theorem [3] arise from the following well-known construction. 
For any integer d ^ 1, with p\ d, any non-trivial multiplicative character 

X ■■ F 9 ^C X 

such that x d 1? an d any squarefree polynomial g G F g [X] of degree d, we let 

x£F q 

where x(0) is defined to be 0. These are multiplicative exponential sums, and have been 
studied intensively, due in part to their many applications to analytic number theory (for their 
generalizations to multiple variables, see the paper [T7] of Katz). 

It is also well-known that one can construct a non-trivial Dirichlet character rj = rj(g,x), 
primitive modulo g, such that 



Z(rj,T) = exp(^ 



m 



where S m (x,g) denotes the "companion" sums over extensions of F q , namely 

s m (x,g)= Yl x(N Fqm / Fq (g(x))), 

where N-p m /F q is the norm map. We will denote L(s,g,x) the corresponding L-function. 

Moreover, we have the Riemann Hypothesis for these L-functions (due to A. Weil), which 
gives the link with random unitary matrices: there exists a unique conjugacy class x ^ g (q) in 
the unitary group U(d — 1) such that 

L(s + l,g,x) = det(l-q- s e Xtg {q)), 

(so that, in particular, we recover the Weil bound 

\S( x ,g)\<(d-l)q^ 2 , 

by looking at the trace of 6 X) g{q)). For all this, one can see, for instance, [26, §4.2], which 
contains a self-contained account. 

We will first prove the following theorem, which is clearly a stronger form of Theorem in 
view of the preceding remarks: 

Theorem 10. For d ^ 1 and t G Z, let g d j = X d - dX - t G Z[X]. For p prime, let X(p) 
denote the set of pairs (x>i) where x i> s non-trivial character ofF p and t G F p . 

Let zq G C x and e > with e ^ \zo\ be given. For all integers d do(zQ,e), we have 

limiKf l{fe f ) £l M I X d ^h\L(^g d , t ,x)-z \ <e}\ ^ / e y 1 
p->+oo |-^(p)| M z oK logci' 
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where 

d (z ,e) < max|exp((log |z |) 2 ),exp^C^|^|-^ )}, 

where C ^ and the implied constants are absolute. 

This result depends on the mod-Gaussian convergence for characteristic polynomials on U(N) 
(i.e., on Example [[]). Indeed, denoting by U(N)$ the space of conjugacy classes in U(N), we 
have the following: 

Theorem 11. For any integer d > 5, any odd prime p with p\ d(d — 1), the conjugacy classes 

{Qx,9d,t(p) I X(modp), x ¥= !, andt£ F p with - (1 - d) d_1 / O(modp)} 

become equidistributed in U(d — 1)" as p —> +oo, with respect to Haar measure. 

Proof. This is an easy consequence of results of Katz (see [18} Th. 5.13]), the only "twist" 
being the extra averaging over all non-trivial Dirichlet characters to obtain unitary instead of 
special unitary (or similar) equidistribution. 

First of all, it is easy to check that if p \ d(d — 1) and t G F p is such that t d ~ x ^ (1 — d)^ 1 , 
the polynomial g^t = X d — dX — t G F P [X] is a strong Deligne polynomial in one variable (in 
the language of [18], these are called "weakly-supermorse" polynomials). Hence the conjugacy 
classes in the statement are well-defined. 

For simplicity, denote U the open subset of the affine i-line where ^ (1 — d) . 

Now, according to the Weyl equidistribution criterion, we must show that 

1 * 1 

lim V 7T77^--7 V TrA(6L , t (p)) = 0. 

for any (fixed) non-trivial irreducible unitary representation A of the compact group U(d — 1). 

We isolate in the sum those characters x where x 2d = 1 ; there are at most 2d of them. 
For any other character x( m °dp), the inner sum over t G U(F p ) is of the type handled by 
the Deligne Equidistribution Theorem. Let k = k{\) be the order of the Dirichlet character 
XX2, where X2 is the real character modulo p. By Th. 5.13, (2)] (the restriction x 2d ^ 1 
ensures the assumptions hold), provided p \ d(d — 1), e.g., p > d(d — 1), the relevant geometric 
monodromy group is equal to 

GL k (d -l) = {ge GL(d - 1) | det( 5 ) fe = 1}, 

with maximal compact subgroup 

U k (d - 1) = {g G U{d - 1) | det( 5 ) fc = 1}. 

For simplicity, we write U = U(d — 1), Uk = U k (d — 1). The restriction of A to U k is a 
finite sum of irreducible representations of this group (possibly including trivial components). 
Applying [2H Th. 9.2.6, (5)] to each of the non-trivial ones (and the obvious identity for the 
trivial components), we find that 

— l — Y, TrA(^(p)) = (A|C/ fc ,l)+0((dimA)dp- 1 /2) 
1 1 p)l teit(F p ) 

where the implied constant is absolute (this is because we have a one-parameter family, so we 
can apply |21|. 9.2.5] and the fact proved in |18l 5.12] that the relevant sheaf is everywhere tame, 
so the Swan-conductor contribution is zero; the parameter curve U has d points at infinity, which 
give the factor d above). 

Using Frobenius reciprocity or direct integration (using, e.g, [21, Lemma AD. 7.1]), we find 
that the multiplicity of the trivial representation in A (restricted to Uk) satisfies 



(A|^,l) = ^(A,det(-) /l ^ 
hez 



if A = det{-) hk for some h G Z - {0}, 
otherwise. 
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For a given A, this is equal to 1 only if dim A = 1, so A = det(-) r for some r G Z — {0}, and if 
X is such that k(x) \ r. The number of such characters x is therefore <C 1, the implied constant 
depending on A. Hence we find, after adding back the characters with x 2d = 1> that 

1 v-^* 1 v-^ rr. a , n , w d (dimA)d 

where the implied constant depends on A. This confirms the claimed equidistribution. □ 

Proof of Theorem\l(A This an easy consequence of Theorem lilt for any open set V C C, we 
have first that 

|U(P p )|~p 
as p goes to infinity, and then we can write 

limmf—^\{( X ,t)eX( P ) | xVl, L$, gdi ux)eV}\ 

p^+oo \X(p)\ 

> tid-l({9 e U{d - 1) | det(l -g)€ V}), 
and then we apply Theorem □ 

Remark 10. In Theorem II 11 we performed the average over x, because for "standard" families 
of exponential sums (those parametrized by points of algebraic varieties), the (connected com- 
ponent of the) geometric monodromy group is always semisimple, so its center is finite and its 
maximal compact subgroup can never be U(N). However, one may expect that t could be fixed 
in the example above, and that (for instance) the conjugacy classes 

{6 x (p) = 9 X: x d ~dX-i I X(modp) non-trivial} 
corresponding to the exponential sums 

s(x) = E *( xd - dx - l ) 

xeF p 

(where d > 6, p \ d(d — 1)) would be already equidistributed in U(d — 1)" as p — > +oo. 

Very recently, N. Katz [20J has indeed shown that for such families there is always an a-priori 
"Sato- Tate law", i.e., that the conjugacy classes become equidistributed in K} for some compact 
group K C U{d— 1). In fact, in this work, Katz proves the required equidistribution for (most) 
fixed t in the vertical direction where one looks at characters of F q m for fixed q and m — > +oo 
(see [Ms Th. 7.2, Th. 17.6, Remark 17.7]). It seems quite possible that the "horizontal" 
direction we are interested in will also follow from these new techniques. 

Similarly, it is likely that families of hyper-Kloosterman sums of certain types exhibit full 
unitary monodromy, e.g., for any integer n ^ 1, any additive character ip : F q — > C x , any 
multiplicative character x '■ — )• C x , one can consider the sums 

X{vi)i>{xi H h 

zi,...,x n eFq 

Xl---X„ = l 

for which the basic theory (due to Deligne §7]) shows that the associated L-function (unitarily 
normalized so the central point is s = 0) is 

det(l - q- s 9^ x (q)) 

for some unique conjugacy class 9^ >x (q) £ U{n — 1). One would then expect (this was suggested 
by Katz, and is again potentially in the realm of his recent work [20]) that if we fix a character 
■ip for F p (e.g., ip(x) = e{x/p)) and define ip m (x) = ip(TrY p m/F p ( x )) f° r an rn ^ 1, the sets of 
conjugacy classes 

{Oip m ,x(P m ) I X : F^C*} 
become equidistributed in U(n — 1)" as m — > +oo. 
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4.2. Symplectic symmetry. A typical example of symplectic symmetry involves families of 
L-functions of algebraic curves over finite fields. For simplicity, we will consider one of the 
simplest ones, but we first start by proving distribution results for characteristic polynomials 
of symplectic matrices, which are of independent interest. 

We first remark that for A G USp(2g,C), the characteristic polynomial can be expressed in 
the form 

det(l -TA)= Yl (l-e^T)(l-e-^T) 
for some eigenangles 6j, 1 ^ j ^ g, and it follows that 



det(l - A) 



n 



l_ e *J)| 2 > 0. 



This positivity is reflected in a shift in expectation in the mod-Gaussian convergence (it also 
means that the argument is not an interesting quantity here). We obtain: 

Proposition 12. For g ^ 1, let 

X g = logdet(l-r ff )-ilog(f ), 

where T g is a Haar- distributed random matrix in the unitary symplectic group U Sp(2ji, C) . Then 
X g converges in mod-Gaussian sense with Q g (t) = (log i<?)i 2 and limiting functio 



G(f + 



Indeed, we have 



exp(-(logi 5 )t 2 /2)^ P (t)(l + 



1 + 1*1 



for \t\ ^ g 1 / 6 , where the implied constant is absolute. 

Figure 2 is a graph illustrating the logarithmic growth of 4- log |<l?5p(i)|. 



(27) 
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Figure 2. Graph of 4 log \$>s P {t)\, 1 < t ^ 140 



The expressions in 123, (32), (67)] and Cor. 4.2] are rather more complicated, but of course they are 
equivalent. 
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Proof. (Compare [15j Prop. 4.9]) Keating-Snaith [23J (10)] compute that 

Efe itiogdet ( i-r s h = 2 2 9 u A ni+g+mh+it+j) 
[ 1 fink+jw+it+g+j) 



J 

which, together with the formula ([23]) . gives 

'3\ nt?> 



E(e itx a) = (*9\-^ G(l) ^ u G(l + it + g )G(2 + 2g)G(2 + it + g) 
{ ' \2J G{\ + it) G(l+g)G(2 + g)G(2 + it + 2g) 

By applying Proposition [T71 (3) in the Appendix, we get 



E(e*"«)= $ Sp (t) 1 + 



,27 "^'V V 5 

as claimed. □ 

In particular, the Central Limit Theorem for det(l — T g ) takes the form of the convergence 
in law 

log det(l -T g )-\ log ^ i aw 



(log( 5 /2))V2 

so, for any a < b, we have 



(standard Gaussian), 



.(P) 1/ VVW)<det(l-T 5 )< f^) 1/2 e^ h. _L /* 
V V 2 / V 2 / J \/2it J a 



e- t2 / 2 dt. 



On the other hand, by applying Theorem [U as we can according to the previous proposition, 
we can control the probability of the values of det(l —T g ) in much smaller (dyadic or similar) 
intervals: 

Corollary 13. Let U =]a,b[ with a <b real numbers. We have 



p(e«(f) 1/2 <det(l-T 9 )<^(f) 1/2 )=^^/ exp(--V 
V V 2 ' \2) ) J 2n l og y J a V 21ogi ff 



dt 



/max(6 — a, (b — a) 1 ) max(l,fc — a) \ 
+ V bg^ + (log^/a+i^gj 



for g ^ 2, where the implied constant is absolute. In particular 



+ (noKo)V 



. (log 5 ) V2+1/29. 

Proof. If 6 — a ^ 1, we apply Theorem [H with the constants fj, = 1, A arbitrarily close to 2, a 
arbitrarily large and C = 1/2, so that D can be any number with 

D > 2(1 + 1 + 12) = 28, 

and in particular D = 29 is valid. If b — a > 1, we split the interval ]a, b[ into 2 \b — a] intervals 
of length 

1 b-a 
4 ^ 2\b-a\ 

and apply the previous case to the interior of those intervals. Since the joint distribution of 
eigenvalues of T g is absolutely continuous with respect to Lebesgue measure, the probability 
of falling on one of the missing endpoints is zero, and summing over these intervals gives the 
result. □ 
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We now deduce an arithmetic corollary, using families of hyperelliptic curves over finite fields. 
For any odd q, any integer g ^ 1 and any squarefree monic polynomial / G F 9 [X] of degree 
2g + 1, let Cj be the smooth projective model of the affine hyperelliptic curve 

C f : y 2 = f(x). 
The number of F^m-rational points on Cf satisfies 

\ Cf (F qm )\=q m + l- £ X2(N Fqm/Fq (f(x))) = q m + l-S m (x2j) 

x£F q m 

where \2 is the quadratic character of F* and the notation is as in Section 14.11 The associated 
L-function (the numerator of the zeta function) is defined by 

L(Cf,s) = L(s,f,X2), 

or, in other words, we have 

L(C f ,s) = Z(C f ,q- s ), Z(C f ,T) = exp(j2 5m( * 2 ' /} r™). 

Weil proved that Z(Cf,T) is a polynomial in Z[T], of degree 2g, all roots of which have 
modulus y/q, and which is symplectic: there is a unique conjugacy class 6f(q) in USp(2g,C) 
such that 

L(C f ,s+l)=det(l-q- s e f (q)). 

Theorem 14. Let % g (F q ) be the set of squarefree, monic, polynomials of degree 2g + 1 in 
Fg[X]. Fix a non-empty open interval }a, /3[c]0, +oo[. For all g large enough, we have 



lim inf ■ 



g^+oo \"K a (F a 



if e X g (F p ) | L{C ^ ] G]a,/3[} » - 7f ^. (28) 
1 VW 2 J A/loeo 



V lo g 9 ' 



(Note that this is in fact a very weak version of what we can prove) . 



Proof. Let first 5C*(F 5 ) be the set of / G 5C fl (F,) for which L(C f , 1/2) / 0. In [El Prop. 4.9], 
we showed, using the relevant equidistribution computation in |21|, 10.8.2] that the (real-valued) 
random variables 

L g = logdet(l-M<?))-ilog(Tr), 
on JC*(F g ) (with counting measure) converges in law to 

X fl = logdet(l-T ff )-ilog(f ), 

where T g is a random matrix in the unitary symplectic group USp(2g, C), distributed according 
to Haar measure. The previous proposition shows that Theorem H] is applicable to X g with 
covariance Q g (t) = (log \g)t 2 and limiting function $s p (t). Letting q — > +oo as in the previous 
section, we get 

for g large enough. Since 

\2C B (F q )\ = \X g (F q )\(l + o(l)) = ^ +1 (1 + o(l)) 

for fixed g and q — > +oo (by an easy application of the equidistribution, see [151 Prop. 4.9]), 
we get the result stated by exponentiating. □ 

Remark 11. The lower bound (|28p is good enough to combine with various other statements 
proving arithmetic properties of L-functions which hold for "most" hyperelliptic curves. For 
instance, from \25\ Prop. 1.1] (adapted straightforwardly to all hyperelliptic curves instead of 
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special one-parameter families), it follows that if we denote by Ji g (F q ) the set of / £ "K g {F q ) 
such that the eigenvalues of Of(q) satisfy no non-trivial multiplicative relationjf] then we have 

\{feX g (F q ) | / i X g (F q )}\ «, q 1 -^ 
for some 7 = 7(3) > 0, and hence we get 



lim inf ■ 



l 

> 



?-*+oo |JC fl (F,) 
for 5 large enough. 

4.3. Orthogonal symmetry. Orthogonal symmetry, in number theory, features prominently 
in families of elliptic curves. In contrast with symplectic groups, there are a number of "flavors" 
involved, due to the "functional equation" 

T N det(l - T~ l A) = det(-A) det(l - TA) 

for an orthogonal matrix A £ 0(N, R) (the standard maximal compact subgroup of the or- 
thogonal group 0(N, C)), which implies that det(l — A) is zero for "trivial" reasons if N is 
even and det(^4) = — 1 or N is odd and det(A) = 1. When this happens, it is of great interest 
to investigate the distribution of the first derivative at T = 1 of the reversed characteristic 
polynomial. For simplicity, however, we restrict our attention here to N even and matrices with 
determinant 1, i.e., to the subgroup SO(2iV, R) of 0(2N, R), where N ^ 1. In that case, it is 
also true that eigenangles come in pairs of inverses, and therefore we have det(l — A) ^ 0. 
As in the previous section, we start with random matrix computations. 

Proposition 15. For N ^ 1, let 

X JV = logdet(l-r JV )-|log(f ), 

where T/v is a Haar- distributed random matrix in the special orthogonal group SO(2N, R). Then 
Xj\f converges in mod- Gaussian sense with Qat(^) = (l°g \N)t 2 and limiting function 

**° {t) = l0wy (29) 

Indeed, we have 

E(e^) = exp(-(log \N)t 2 /2)<S> so {t) (l + 

for \t\ ^ iV 1 / 6 , where the implied constant is absolute. 
Proof. Using [231 (56)] and ([23]), we get 

jv it\ogdet(l-T N )\ _ 2^Nit TT r(iV + j- l)T(it + j - \) 

y ' f = \T(j-l)r(it + N + j-iy 

G{\) ^ o2NU G(±+it + N)G(2N)G(it + N) 



G{\ + it) G{\ + N)G{N)G{it + 2N) 

and by applying Proposition [TTJ, (4) in the Appendix, we get the desired formula 



2) w \ V JV 



□ 



Remark 12. If we compare with the symplectic case, we observe the (already well-established) 
phenomenon that the value det(l — ^4), for A G SO(2N, R) tend to be small, whereas they tend 
to be large for symplectic matrices in USp(2g, C). 



® An analogue of the hypothetical statement of Q-linear independence of the ordinates of zeros of the Riemann 
zeta function; non-trivial refers to a relation that can not be deduced from the fact that, if e lS is an eigenvalue, 
so is its inverse e~ %e . 
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Our arithmetic corollary is based on families of quadratic twists of elliptic curves over function 
fields, and we select a specific example for concreteness (see [24, §4]); the basic theory, which 
we illustrate here, is again due to Katz [19] . 

For any odd prime power q ^ 3, any integer N ^ 1, we consider the elliptic curves over the 
functional field F q (T) given by the Weierstrass equations 

£ 2 : Y 2 = (T N — NT — 1 — z)X{X + 1){X + T), 

where z £ is a parameter such that z is not a critical value of T N — NT — 1. 

Katz proved that the associated L-function (which is now defined by the "standard" Euler 
product over primes in F 9 [T], with suitable ramified factors) is of the form 

L(E z ,s + l) = det(l - 9 z (q)Q~ s ) 
where 6 z (q) is a unique conjugacy class in 0(2N,H). 

Theorem 16. Fix a non-empty open interval ]a,/3[c]0, +oo[. For all N large enough, we have 

/iV\l/2. 



lim inf — 

p^+oo p 
(p-l,7V-l)=l 



£(£,,1/2) E]a,/3[} 



> 



Proof. As recalled in |24} Cor. 4.4 and before], for all iV 146 and primes p with p \ N{N — 
1)(N + 1) and (p — 1, N — 1) = 1, the conjugacy classes 6 z (p), for z £ F p not a critical value, 
become equidistributed in 0(2N, R)f for the image of Haar measure (precisely, this is stated 
for the "vertical direction" where p is fixed and finite fields of characteristic p and increasing 
degree are used; however, because the parameter variety is a curve with N + 1 points at infinity 
and the relevant sheaf is tame, we can recover the horizontal statement as in the proof of 
Theorem 1 11|) . In particular, there is a subset ^cFj, with \V P \ ~ p/2 where det(9 z (p)) = 1 and 
those restricted conjugacy classes become equidistributed in SO(2N, R)*. Hence, for N large 
enough, we get 



lim inf — — 

p— >+oo \Vp\ 

(p-l,iV-l)=l 



z e V p 



AT \ 1/2 
8^ 



L(£ z ,l/2) G]a,/3[} 



> Vso(2N,n)(i A I logdet(l-A) G]loga,log/3[}) » -= _ , 
as desired. □ 

Remark 13. Obviously, this result (or its generalizations to other families of quadratic twists 
over function fields) has interesting consequences concerning the problem of the distribution of 
the order of Tate-Shafarevich groups of the associated elliptic curves, through the Birch and 
Swinnerton-Dyer conjecture (which is known to be valid in its strong form for many elliptic 
curves over function fields over a finite field with analytic rank or 1). We hope to come back 
to this question, and its conjectural analogue over number fields, in another work. 

Appendix: estimates for the Barnes function 

We present in this appendix some uniform analytic estimate for the Barnes function, which 
are needed to verify the strong convergence assumption ([7]) for sequences of random matrices 
in compact classical groups. Note that we did not try to optimize the results. 

Proposition 17. (1) For all z £ C and n ^ 1 with \z\ ^ n 1//6 , we have 

G( j, + Z + n) = (2vr)^ 2 e -(™ +1 ^(l + nf' 2+nz ( 1 + 0(^±^) ) . (30) 
G(l + n) v \ n / / 

(2) For all N ^ 1 and all t = (h,t 2 ) G R 2 with \\t\\ ^ N 1 / 6 we have 



G(l + ih + N)G(l + N) Ar -( t?+t2 )/4A, , „a+\\t 



+ iti^h + N)G(1 + + N) V V N 
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A T-(t?+t 2 )/4^ 1 



+ 



1 3 



(3) For all g ^ 1 and all t G R with \t\ ^ <? 1//6 we have 

22qit G(l + it + g)G(2 + 2g)G(2 + it + g) _ /gyt 2 1 2 / fayt , + 
G(|+ 5 )G(2 + <7)G(2 + it + 2 ff ) V2/ V\/ 2 / V V g 

(4) For all N ^ 1 and allteK with \t\ ^ iV 1 / 6 we /ioue 
2Mt G(| + it + iV)G!(2JV)G(it + JV) fN\-#/*( f^y*^ , ^l + |t| 



2 ZJV8t -^__ 7 v i = _ W— l + O 

G(± + iV)G(g)G(it + 2JV) V2/ WiV/ V V iV 

In estimates, the implied constants are absolute. 

Proof. One can use the asymptotic expansions in [11], but we follow instead the nice arrange- 
ment of the Barnes function in |10} Cor. 3.2], which leads to a quicker and cleaner proof. 
(1) First, the ratio of Barnes function is well-defined since n ^ 1. We now use the formula 



G{l + z + n) 



(27r) z/2 e- (n+1)2 (l + n) z2 / 2+nz S n (z), (31) 



G(l + n) 
where 

i iwo -i-r / Z\ k ~ n ( \\z 2 /2+nz 

s„ W = H + (1 + 5) 

which is valid for zeC, n)l (see [10], p. 241]). 

If we expand the logarithm, defined using the Taylor expansion of log(l + w) at the origin, 
we have 

iv 2 

log(l + w) =w — + 0(w~ 3 ) 

for | w | ^ 1/2, with an absolute implied constant. Hence we obtain 

z 2 

k) ' 2~~°V ' k 



^S n (z) = -^1 + £ (*]flg(l + £)+!■ log(l + \ 

k>n 

+ ^n(zlog(l + i) -log(l + | 

k>n 

z(z — 1) z 2 sr^ 1 z(z — 1) 1 ^/sr^/z 2 z 



2 2 ^ k 2 ' " 2 ^ A; 2 ' ~ Vfc 3 + A; 2 

fe>n fc>n fe>n 

with an absolute implied constant, for n ^ 1 and \z\ ?J ra/2, hence for |z| ^ n 1 / 6 we get 

(„2 I 3 
n 

since 



A; 2 n \n 2 /' A; 3 2n 2 \n i 



for n ^ 1, with absolute implied constants. Hence, we have 

S n (z) = 1 + 



z 2 + z 3 
n 

for |z| ^ n 1 / 6 , for some absolute implied constant, and we get the stated formula 

G(l + Z + n) ,„ u/2 -(n+l)z/i i „\z 2 /2+Wi , / ^ + ^ ' 



(2ny/ 2 e-^ z (l + n) z2 l 2+nz (l + o(^t£_) ) . 



G(l + n) V V n 

(2) Note first that the conditions N ^ 1 and < iV 1 / 6 ensure that the values of the Barnes 
function in the denominator are non-zero. Next, let u = (it\ — t2)/2, v = (iti + *2)/2; we can 
express the ratio of Barnes function as 

G{1 + u + v + N)G{1 + N) _ G{l + u + v + N) G(l + N) G(l + N) 
G(l + u + N)G(l + v + N) ~ G(l + N) G(l + u + N) G(l + v + N)' 
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and ||t|| N 1 / 6 gives \u\, \v\ ^ iV 1 / 6 , allowing us to apply f|30[) three times. The exponential 
terms cancel out, leading to 

G(l + u + v + N)G(1 + N) M , An _i|tll 2 /4A, , ^/|t| 2 + |i|' 



;i + iY)- |l<l|2/4 (i + o( J 



G(l + u + iV)G(l + u + N) v y V V N 

which gives the first part of the proposition. 

(3) We use a similar computation, applying ([30]) six times with the parameters (n, z) 

(2<?,1), (g,l + it), (g,±+it), (g,l), (2<?,1 + it), (g, ±), 
leading, after an easy calculation, to a main term 

(27r) it / 2 e" i *(l + ^) - * 2 +3^/2+2i<?i( i + 2gf /2-it-2igt 

for the ratio of Barnes functions, and some further computation leads to the stated result (each 
parameter z has \z\ ^ \t\ + 1, so the error term is also as given). 

(4) We argue exactly as in the previous case, with parameters (n, z) given now by 

(2JV-1,0), (N-1,0), (JV- 1,^+1), (JV-1, it), (2N-l,it), (iV-l,|), 
and we get a main term 

(27r) ii / 2 A r ~* 2 ~ 3i */ 2+2 * iV * (2Ny 2 / 2+it ~ 2itN = 2* 2 / 2 + u ~ 2itN (27T)*'/ 2 TV"' 2 '/ 2 ^ it / 2 
which leads to the conclusion. □ 
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